[INFERENCE PROVIDERS] guide on structured output #1824

burtenshaw · 2025-07-08T13:32:20Z

This PR adds a guide on structured outputs with inference providers. It uses hf client, openai client, cerebras, qwen3 32b

HuggingFaceDocBuilderDev · 2025-07-08T13:39:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu

Thanks!

stevhliu · 2025-07-08T17:27:38Z

docs/inference-providers/_toctree.yml

  - local: guides/building-first-app
    title: Building Your First AI App
+  - local: guides/structured-output
+    title: Structured Outputs with LLMs


I would align the toctree title with the doc title (or vice versa) to avoid confusion for users when referencing the doc

shouldn't the guides be under the provider list?

i think we want to focus on providers more

(forgot to post this on the previous PR @burtenshaw sorry =)

No worries. I'll make another PR with the menu change.

docs/inference-providers/guides/structured-output.md

stevhliu · 2025-07-08T17:32:26Z

docs/inference-providers/guides/structured-output.md

@@ -0,0 +1,467 @@
+# Structured Outputs with Inference Providers
+
+In this guide, we'll show you how to use Inference Providers to generate structured outputs that follow a specific JSON schema. This is incredibly useful for building reliable AI applications that need predictable, parseable responses.


Suggested change

In this guide, we'll show you how to use Inference Providers to generate structured outputs that follow a specific JSON schema. This is incredibly useful for building reliable AI applications that need predictable, parseable responses.

In this guide, we'll show you how to use Inference Providers to generate structured outputs that follow a specific JSON schema. This is incredibly useful for building reliable AI applications that need predictable, parsable responses.

stevhliu · 2025-07-08T17:37:36Z

docs/inference-providers/guides/structured-output.md

+
+We'll create a simple schema that captures the most essential elements: the paper's title and a summary of its abstract. The easiest way to do this is to use Pydantic, a library that allows you to define Python classes that represent JSON schemas (among other things).
+
+<hfoptions id="json-pydantic">


I wouldn't use the toggles here since it sets up as a kind of "choose one or the other". After showing the Pydantic code example, maybe transition with:

"The Pydantic represented JSON schema is shown below."

docs/inference-providers/guides/structured-output.md

stevhliu · 2025-07-08T17:50:39Z

docs/inference-providers/guides/structured-output.md

+
+</hfoption>
+
+The OpenAI client returns a `ChatCompletion` object, which contains the response from the model as a Python object. You can then access the structured data using the `title` and `abstract_summary` attributes of the `PaperAnalysis` class.


This should go under the <hfoption id="openai"> toggle

docs/inference-providers/guides/structured-output.md

julien-c

cool doc page, is there one about tool calling coming up as well? :)

Note, @SBrandeis has been building some validation matrix of model/provider support for structured output & function calling, we could dynamically display it here in this page (or link to it from this page)

cc @gary149 too for viz

burtenshaw · 2025-07-09T18:30:30Z

cool doc page, is there one about tool calling coming up as well? :)

Yep. It's coming tomorrow.

Note, @SBrandeis has been building some validation matrix of model/provider support for structured output & function calling, we could dynamically display it here in this page (or link to it from this page)

👍

SBrandeis · 2025-07-10T08:10:50Z

docs/inference-providers/guides/structured-output.md

+
+## Step 2: Set up your inference client
+
+Now that we have our schema defined, let's set up the client to communicate with the inference providers. We'll show you two approaches: the Hugging Face Hub client (which gives you direct access to all Inference Providers) and the OpenAI client (which works through OpenAI-compatible endpoints).


Sadge - it's only Python 🥲

true. I'll come back to that on all new guides.

SBrandeis

very useful !!

Co-authored-by: Steven Liu <[email protected]>

…ace/hub-docs into guide-structured-output

first draft of guide on structured output

87ffe8d

burtenshaw requested review from NielsRogge, SBrandeis, Vaibhavs10 and hanouticelina July 8, 2025 13:32

burtenshaw added 3 commits July 8, 2025 15:33

add to toc

6d624c7

fix toc link

b52bd0e

more fixing

f47aa15

burtenshaw requested a review from stevhliu July 8, 2025 16:36

stevhliu approved these changes Jul 8, 2025

View reviewed changes

julien-c reviewed Jul 9, 2025

View reviewed changes

SBrandeis reviewed Jul 10, 2025

View reviewed changes

SBrandeis approved these changes Jul 10, 2025

View reviewed changes

burtenshaw and others added 5 commits July 10, 2025 11:45

Apply suggestions from code review

1d3584a

Co-authored-by: Steven Liu <[email protected]>

remove python / json tabs

db1d5eb

Merge branch 'guide-structured-output' of https://github.com/huggingf…

679f5d8

…ace/hub-docs into guide-structured-output

spelling

6fb52af

tab placement

4426680

burtenshaw merged commit 96c7427 into main Jul 10, 2025
2 checks passed

burtenshaw deleted the guide-structured-output branch July 10, 2025 09:55

		@@ -0,0 +1,467 @@
		# Structured Outputs with Inference Providers

		In this guide, we'll show you how to use Inference Providers to generate structured outputs that follow a specific JSON schema. This is incredibly useful for building reliable AI applications that need predictable, parseable responses.


		We'll create a simple schema that captures the most essential elements: the paper's title and a summary of its abstract. The easiest way to do this is to use Pydantic, a library that allows you to define Python classes that represent JSON schemas (among other things).

		<hfoptions id="json-pydantic">


		</hfoption>

		The OpenAI client returns a `ChatCompletion` object, which contains the response from the model as a Python object. You can then access the structured data using the `title` and `abstract_summary` attributes of the `PaperAnalysis` class.


		## Step 2: Set up your inference client

		Now that we have our schema defined, let's set up the client to communicate with the inference providers. We'll show you two approaches: the Hugging Face Hub client (which gives you direct access to all Inference Providers) and the OpenAI client (which works through OpenAI-compatible endpoints).

[INFERENCE PROVIDERS] guide on structured output #1824

[INFERENCE PROVIDERS] guide on structured output #1824

Uh oh!

Conversation

burtenshaw commented Jul 8, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 8, 2025

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

burtenshaw commented Jul 9, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

burtenshaw Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

burtenshaw Jul 10, 2025 •

edited

Loading